Consistent Risk Estimation in Moderately High-Dimensional Linear Regression

نویسندگان

چکیده

Risk estimation is at the core of many learning systems. The importance this problem has motivated researchers to propose different schemes, such as cross validation, generalized and Bootstrap. theoretical properties estimators have been extensively studied in low-dimensional settings, where number predictors p much smaller than observations n. However, a unifying methodology accompanied with rigorous theory lacking high-dimensional settings. This paper studies risk under moderately asymptotic setting n,p → ∞ n/p δ > 1 ( fixed number), proves consistency three that successful numerical studies, i.e., leave-one-out validation (LOOCV), approximate (ALO), message passing (AMP)-based techniques. A corner stone our analysis bound we obtain on discrepancy `residuals' obtained from AMP LOOCV. connection not only enables us more refined information estimates AMP, ALO, LOOCV, but also offers an upper convergence rate each estimator.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Consistent group selection in high-dimensional linear regression.

In regression problems where covariates can be naturally grouped, the group Lasso is an attractive method for variable selection since it respects the grouping structure in the data. We study the selection and estimation properties of the group Lasso in high-dimensional settings when the number of groups exceeds the sample size. We provide sufficient conditions under which the group Lasso selec...

متن کامل

Pivotal Estimation in High-dimensional Regression via Linear Programming

We propose a new method of estimation in high-dimensional linear regression model. It allows for very weak distributional assumptions including heteroscedasticity, and does not require the knowledge of the variance of random errors. The method is based on linear programming only, so that its numerical implementation is faster than for previously known techniques using conic programs, and it all...

متن کامل

Package hdlm: Regression Tables for High Dimensional Linear Model Estimation

We present the R package hdlm, created to facilitate the study of high dimensional datasets. Our emphasis is on the production of regression tables and a class ‘hdlm’ for which new extensions can be easily written. We model our work on the functionality given for linear and generalized linear models from the functions lm and glm in the recommended package stats. Reasonable default options have ...

متن کامل

Robust High-Dimensional Linear Regression

The effectiveness of supervised learning techniques has made them ubiquitous in research and practice. In high-dimensional settings, supervised learning commonly relies on dimensionality reduction to improve performance and identify the most important factors in predicting outcomes. However, the economic importance of learning has made it a natural target for adversarial manipulation of trainin...

متن کامل

Multiple testing in high-dimensional linear regression

In many real-world statistical problems, we observe a large number of potentially explanatory variables of which a majority may be irrelevant. For this type of problem, controlling the false discovery rate (FDR) guarantees that most of the discoveries are truly explanatory and thus replicable. In this talk, we propose a new method named SLOPE to control the FDR in sparse high-dimensional linear...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Information Theory

سال: 2021

ISSN: ['0018-9448', '1557-9654']

DOI: https://doi.org/10.1109/tit.2021.3095375